Shrink and Eliminate: A Study of Post-Training Quantization and Repeated Operations Elimination in RNN Models

نویسندگان

چکیده

Recurrent neural networks (RNNs) are (NN) designed for time-series applications. There is a growing interest in running RNNs to support these applications on edge devices. However, have large memory and computational demands that make them challenging implement Quantization used shrink the size needs of such models by decreasing weights activation precision. Further, delta method increases sparsity vectors relying temporal relationship between successive input sequences eliminate repeated computations accesses. In this paper, we study effect quantization LSTM-, GRU-, LiGRU-, SRU-based RNN speech recognition TIMIT dataset. We show how apply post-training with minimal increase error skipping selected paths. addition, integer precision leads considerable if applied. Then, propose increasing while minimizing maximizing percentage eliminated computations. The proposed managed compress four more than 85%, an 0.6, 0, 2.1, 0.2 points, respectively. By applying quantized models, 50% operations can be eliminated, most cases only minor error. Comparing each other under method, found compressed LSTM-based most-optimum solutions at low-error-rates constraints. smallest size, suitable when higher rates acceptable, LiGRU-based highest number operations.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a contrastive study of rhetorical functions of citation in iranian and international elt scopus journals

writing an academic article requires the researchers to provide support for their works by learning how to cite the works of others. various studies regarding the analysis of citation in m.a theses have been done, while little work has been done on comparison of citations among elt scopus journal articles, and so the dearth of research in this area demands for further investigation into citatio...

a comparative study of the relationship between self-, peer-, and teacher-assessments in productive skills

تمایل به ارزیابی جایگزین و تعویض آن با آزمون سنتی مداد و کاغذ در سالهای اخیر افزایش یافته است. اکثر زبان آموزان در کلاس های زبان از نمره نهایی که استاد تعیین میکند ناراضی اند. این تحقیق جهت بررسی ارزیابی در کلاس های زبان انگلیسی به هدف رضایتمندی زبان آموزان از نمره هایشان انجام گرفته است که در آن نمرات ارائه شده توسط سه گروه ارزیاب (ارزیابی خود دانشجو، همسالان واستاد) در مهارت های تولید (تکل...

15 صفحه اول

patterns and variations in native and non-native interlanguage pragmatic rating: effects of rater training, intercultural proficiency, and self-assessment

although there are studies on pragmatic assessment, to date, literature has been almost silent about native and non-native english raters’ criteria for the assessment of efl learners’ pragmatic performance. focusing on this topic, this study pursued four purposes. the first one was to find criteria for rating the speech acts of apology and refusal in l2 by native and non-native english teachers...

15 صفحه اول

a fundamental study of "histiriographic metafiction", and "literary genres", as introduced in "new historical philosophy", and tracing them in the works of julian barnes.

abstract a fundamental study of “historio-graphic metafiction” and “literary genres”, as introduced in “new historical philosophy”, and tracing them in the works of julian barnes having studied the two novels, the porcupine and arthur & george, by julian barnes, the researcher has applied linda hutcheon’s historio-graphic metafictional theories to them. the thesis is divided into five cha...

15 صفحه اول

a frame semantic approach to the study of translating cultural scripts in salingers franny and zooey

the frame semantic theory is a nascent approach in the area of translation studies which goes beyond the linguistic barriers and helps us to incorporate cognitive and cultural factors to the study of translation. based on rojos analytical model (2002b), which centered in the frames or knowledge structures activated in the text, the present research explores the various translation problems that...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information

سال: 2022

ISSN: ['2078-2489']

DOI: https://doi.org/10.3390/info13040176